1.1 What is Bioinformatics?

3

mitted, received, stored, processed, and interpreted in biological systems” or, more

succinctly, “the application of information science to biology”.

The emergence of information theory by the middle of the twentieth century

enabled the creation of a formal framework within which information could be quan-

tified. To be sure, the theory was, and to some extent still is, incomplete, especially

regarding those aspects going beyond the merely faithful transmission of messages, in

order to enquire about, and even quantify, the meaning and significance of messages.

In parallel to these developments, other advances, including the development

of the idea of algorithmic complexity, with which the names of Kolmogorov and

Chaitin are associated, allowed a number of other crucial clarifications to be made,

including the notion that randomness is minimally informative. The DNA sequence

of a living organism must depart in some way from randomness, and the study of

these departures could be said to constitute the traditional core of bioinformatics.

Although those who argue about the primacy of genes and their immortality might

disagree, biology is, ultimately, about organisms and a set of genes cannot even be

said to specify an organism. In order to survive—and if it did not it would no longer

be alive—an organism must continuously adapt to its environment. Phylogenetic

adaptation, in which the organism’s genes are modified in its descendants, is only one

part of the adaptive survival strategy. Equally indispensable is ontogenetic adaptation,

which involves networks of control and communication within the organism. A

structurally evident example of such a network is the nervous system, 2 of which the

simplest example is perhaps the nematode worm C. elegans, but even the simplest

unicellular organisms have signalling networks based on molecular interactions, and

such networks are also found in all higher organisms, including plants. 3 Shannon’s

theory is directly applicable to flows of information in the channels that connect the

nodes of these networks, and bioinformatics is also concerned with the information

processing that takes place within the nodes, with the overall architecture of the

networks. Given the relative ease with which DNA may be reliably sequenced, it is

understandable that deciphering the message of DNA has been the traditional core of

bioinformatics, but with the accumulation of relevant experimental data, elucidating

the architecture and operation of the signalling networks is becoming another core

of bioinformatics.

Alongside information theory, cybernetics developed as a distinctive science at

around the same time and largely within the same constellation. Its definition is well

conveyed by the subtitle of Wiener’s eponymous book (1948): “the study of control

and communication in the animal and the machine”. 4 The word itself was coined

by Ampère (as cybernétique) more than a century earlier. It is derived from the

Greek kappa upsilon beta epsilon rho nu eta tau zeta sigmaκυβeρνητζσ, meaning steersman, from which we get our Latin gubernetes,

2 The nervous system provides a good example of the inability of genes to specify essential features

of an organism. As Érdi and Barna have remarked (1984) the neural connexions are not specified,

but an algorithm to select favourable connexions is given genetically.

3 See, e.g., Thellier (2017).

4 Second-order cybernetics explicitly includes the observer within what is being studied (Heylighen

and Joslyn, 2001).